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1. Introduction. mek 
Siete. 

A well known test for equality of normal population variances. based on sample vari- e 

rz 
ances, was introduced by Cochran (1941). Suppose o?, i = 1,---,n represents the popula- os ‘ 
WON 
tion variances of n normal populations, and let independent sample variances, each based sd 

. on k degrees of freedom, be s?, i = 1,---,n. Suppose the s? are ranked, so that the ordered xe 

variables are $/,),87);"**sSiqy: To test Ho: the o? are all equal (suppose the common nee 
value is 0%), Cochran (1941) introduced the test statistic es 
2 wy 

S(n) Bot 

2S ieea (1) me 
ae $y Cs 

on 

which compares the largest sample variance with the sum of the sample variances. Clearly Bat 
“ () 

the intent is to discover if one variance is an outlier (too large), and, in general. Ho will * y 
be rejected for large values of Z. However, excessively small values of Z could be useful ee 
also: they will suggest that in some way the sample variances are more homogeneous than oS 
should be expected from a set of normal variances all from equi-variable populations, an me 
event which might occur if samples with widely disperse variances have been omitted from my 
a set of data. Such an event would be important, for example. in monitoring quality or — 2. 
“ 

process control. “st 
La 

Cochran gave some distributional results for Z, and these were used by Eisenhart and nie 

oy 


Solomon (1947) to construct tables for the upper 5% and upper 1% points. 95th and 99th 


percentiles to supplement the very brief table presented by Cochran in his article. Another 
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table with 5% and 1% points. for n < 20 and for slightly different values of k. was given 
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: by Yamauti (1972). The Eisenhart and Solomon tables have been frequently reproduced 


oy 


and we now augment those tables in what follows by producing the additional percentiles 
listed in Table 1 for n = 2(1)10,12(2)20, 25(3)50, 60,120 and k = 1(1)20,30. When k = 2, 


the test statistic serves as the basis of a significance test for any particular term in the 
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harmonic analysis of a series as was demonstrated by Fisher (1929) who also provided a 4 

brief table of 95% and 99% points. In another paper, Fisher (1940) provided another brief 

table that also gave 95% values for the largest and second largest fractions. Tables for my 

the Fisher problem, namely k = 2 were supplemented extensively by Nowroozi (1967) but on 

to our knowledge more exteusive tables for other values of k have nui Lee produced. It an 

might be useful to have such points, and we give them in the tables below. a 
»J 


An obvious application would be to process control or quality control. Historically. 
this was mostly examined using, say, the mean of a sample taken daily, but it has become 


increasingly the practice to examine the variance also, for stability of the process. Thus. for 


example, the seven variances in a week might be examined to see if these were homogeneous, 


Tete DA 


using the test given below. One might not always wish to be limited to comparing the 


ae 


. largest variance with the total, and, for example, the two largest of the week or the four : 
largest of a monthly set of variances, might be tested as too large. The theory given below 7 
can i-e adapted to provide such a test, and work is in progress to provide tables for this an 
more general situation. The test procedure is given in Section 2, followed by theoretical a 
results in Section 3. 3 
2. Test for Equality of n Normal Population Variances. ni 

The test of Ho thus proceeds as follows. > 

1. Suppose for the ith population the sample variance s? is given, (the unbiased estimator Ry 
of a7), based on k degrees of freedom. x 

2. Calculate Z from (1). ~ 
3. Reject Ho given in Section 1 at significance level a if Z > Z_, where Z, is given in 7 
Table 1, for appropriate values n, k, and a. 7 

3’. On occasion, Ho might be rejected at level 1 ~ a if Z is smaller than Z,. Re 
Table 1 has been constructed. for values Z, > 0.5. from an exact formula for upper a 

tail probabilities. given by Cochran (1941) and used by Eisenhart and Solomon (1947): for = 
critical values Z, < 0.5 Pearson curve approximations have been used. The techniques ie 
£ 
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used, and comments on the accuracy of the tables, are given in Section 3. 


3. Theory of the Tests 


3.1. Calculation of critical points Z, > 0.5. Define r; = s?/ 3°, s?. Cochran (1941) 
showed that the probability P(Z > g) is given by 


nin — 1) n(n — 1)(n — 2) f ; 


P(g) = nPylg) — 5 gr Pl) (2) 


where P,(g) is the probability that any one ratio r; exceeds g, P2(g) is the probability that 
two of the ratios both exceed g, etc., and observed that the upper tail probabilities P(g) a 
will be exactly nP,(g) when g exceeds 0.5. Eisenhart and Solomon (1947) showed limits wt r] 
for the accuracy of approximating P(g) by nP\(g) for lower values of g. P(g) is given by “4 


an incomplete Beta function (Cochran, 1941): 


ph x#2-17 — 2) -1 gy 

; NG) SS Paco 0<g<1l (3) oe 
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where B(-,-) is the Beta function. P,(g) can also be evaluated from tables of the F at 
distribution. In Table 1, P,(Z.) has been used to give critical values Z,. when these are roe 

greater than 0.5. 2 

ae 

3.2. Pearson curve approximations for Z, < 0.5. For smaller values of Z, correspond- an 
ing to higher significance levels, we have approximated the distribution of Z by Pearson ee 
curves. For this, the first four moments of Z!/? are used. 2s 

Suppose Z is constructed as follows: Se 
(a) Let yy.y2,:-:.Yn be iid. random variables. each with the distribution o?\2, where ee 

o* is any positive value; let yi) < y(2) < --: < Yn) be the order statistics of the set Q ‘ 

(b) Let ¥ = Syy,. vent 
(c) Then Z = yny/Y¥. TRY 
‘ It is clear that the distribution of Z is independent of o. the scale parameter of y,: ay 
also ¥’ is a completely sufficient statistic for o7. Thus. by the Basu/Hogg/Craig Theorem. aay 

3 WY 


Pe.6 8 0.8 te! 
a ht bg | 


arate ad Barc eee ee Aha Se, . ae PP a re re 
* ‘ : Be Se Se ee Noe ‘ nee. ba 


a ae Pra ae rr er ook fo Fi 
Salta Ener Fae AN 


i a Bd a Swell ag eee tg ey ke ag a acta vctiat wes pew gw uoings, 
Ce ee Te ee fe oe ate ate ath aly 


Z and Y are independently distributed. We can henceforth assume that o = 1. Then 


ZY = yn), and we have 
E(yny) 


where E(.) denotes expectation. The denominator of (5) is easy to find, since Y is a y?- 
variable with kn degrees of freedom: then 


OT {(kn + 2r)/2} 
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For the distribution of y,) suppose G(t) is the distribution of re the distribution of 


yn) is then (G(t)]” = P(yn) < t), and moments are given by 


b= i t"n(G(t))"-" g(t) dt (6) 


where g(t) is the density >f x2. 

Thus the moments of Z or of Z}/? are very easy to calculate, from (4) using (5) and 
(6). The first four moments of Z!/* have been found and used to fit Pearson curves (see 
Solomon and Stephens, 1978) to the distribution of Z!/? and hence to obtain significance 
points Z, for Z. 

3.3. The case when k = 1. 

For & = 1, when each sample variance has only one degree of freedom. there is an 
interesting connection with a distribution in the statistics of directions. Suppose P is 
a point uniformly distributed on the n-sphere with center 0 and radius 1. It is well- 
known (Marsaglia. 1972) that a method to generate such points P is as follows. Generate 
FB then 


X,, 1 = 1,-+-,n are the components of vector OP. Let S, = |.X,|; then Z = Sia) that 


W}.U2,°°°, Wy iid. from (0,1), and calculate ¥; = w;/¥!/? where ¥ = par w 


is, Z, when k = 1, is the square of the largest component of a random unit vector on an 
n-sphere. We can then get some distributional results for Z by fnding the distribution of 
Stays 

>= 1,n=2. For example. when n = 2, OP can be fixed by the angle @ it makes with 
the z-axis. and @ is uniformly distributed. between 0 and 27, written U(0,27). For the 
distribution of S,,, we can confine attention to the first quadrant. 0 < 6 < }: then when 
z x 


2. Sin) = cos@ and when | < 6 < §. Sin) = sin@. It is easily shown that the 
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density fs(r) of S(y) is given by 
fs(z) : ae 
— _—_— zr 
uaa re. a 


The moments yp, = E(S7)) are ie zr’ fs(x)dz, where c = 1/2; these can be easily 


a1: (7) 


calculated to give 


r 


4 


ul, = 0.90032, yw, = 0.81831, py = 0.75026, pw, = 0.69331. 


: Pay ea 


Check with formula (2). From (2) and (3), with k = 1, n = 2, we have for g > 0.5, 


1 
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P(g) = 


since B(}.4) =. Let r = t? and we have 


P(Z > 9g) = P(5S(2) > ¥9) 


oe 2-1/2 ees 2 ' 
= - Qi — ty dt =2~—sin™' Mg, 9g 20.5. 
v V9 Tv 


- This is the same result as obtained by integrating (7). 


Of course, for n = 2 there is a simple correspondence with the F-test for equality of 
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two variances, since Z~) = 1+ ecco It quickly follows that zq, the critical value of Z 
). 


at level a. is related to Fy.4(a/2 ay 
For n = 3. the algebra is more complicated. Let vector OP. where P is now uniformly 

distributed on the first orthant of the unit sphere (that is. all coordinates of OP are an 

positive), have usual spherical coordinates 6. o. Then = = cos@is U(0,1), ando =U(0. F), nd 
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Par) 


so that. if we use rectangular axes for (@.=). probability is uniform on the rectangle FR : 
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0<2<1.0<o0< §. We now want Pr(maximum component of OP < t) = P(t). This is 


e ~ Ip . 


found as follows. 3 


ke=elln=3.t> +. If z > t, and t > Jp, it is clear that z must be the 
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maximum component, for all ¢. Thus P(z >t) =1-—t. By symmetry, z = sin @cos ¢ or @ 
y = sin§sin o (the other two components of OP) could be the maximum component with ee 
equal probability, so that P(maximum component > t) = 3(1 — ). * t 
Equations (2) and (3). forn = 3.4 = 1.9 > 0.5, give eae 
: 1 j Ph, 
Piz >9)=3 | r7'?dr/B( 5.1) = 3(1 - v9): a : 
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thus P(S,3) > \/g) = 3(1 — Vg). in agreement with the result above. 

eae eg oe 5 oe aes wa Then P(S(3) < t) = P(all 3 components < t); this is 
given by the probability over an area bounded by (1) = =t; (2) V1 — 2? coso =: 
(3) V1 — =z? sing =t, roughly in the middle of rectangle R. Thus 


4 a/4 
P(Sa) <= f t-2)ae 


7 61 


where cos 0, = t/V1—t? and where ,/(i — 2*)coso =t; se 


4 w/a ; 1 1 
P(Sa) <t) == f {t—/1-t?sin* o}dd, Aa, 
® < 


1 
This last expression must be evaluated numerically. 
Similar ideas can be used to give upper tail results for higher values of n, for k = 1, 
but again they lead in the end to integrals which must be evaluated numerically. 
3.4. Accuracy of Table 1. 
Various checks on accuracy have been made for the points in Table 1. In fitting the 


Pearson curves. the fit was made to Z!/?, which, when & = 1, is Sin). The numerator of 


a 
ae 


Sin) is the largest absolute value of a set of n standard normals. and the expectation of 


this variable is known ( Biometrika Tables for Statisticians, Vol. 2), since absolute values of 
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standard normals are used in analysis of experiments when main effects and interactions 


are plotted on half-normal plots (see. for example, Bennett and Franklin, 1954). These a 
ne 
known values enabled a check to be made on the accuracy of (6), for k = 1 and r = 3. 4 
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Also. for k = 1.n = 2. the moments of S;,,) could be compared with the exact values given 
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above. 
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In addition. the exact distributions given above have been used. forn = 2andn = 3. to 
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check the percentage points. A further check was made, for k = 2, by comparing the values 


with those given by Nowroozi (1967). Finally, the most extensive check on the Pearson 
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curve points was made by comparing the points Z, with those given by the exact formula 
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(2), when Z, > 0.5. It was found that the Pearson curve fits were very accurate, with 


occasional differences from the exact values in the fourth, or sometimes the third, decimal 


cate 


place: however, these differences will make negligible error in the a-value corresponding 


to the given point Zq. 
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